Detecting and classifying outliers in big functional data
نویسندگان
چکیده
We propose two new outlier detection methods, for identifying and classifying different types of outliers in (big) functional data sets. The proposed methods are based on an existing method called Massive Unsupervised Outlier Detection (MUOD). MUOD detects classifies by computing each curve, three indices, all the concept linear regression correlation, which measure outlyingness terms shape, magnitude amplitude, relative to other curves data. ‘Semifast-MUOD’, first method, uses a sample observations while ‘Fast-MUOD’, second point-wise or $$L_1$$ median indices. classical boxplot is used separate indices from those typical observations. Performance evaluation using simulated show significant improvements compared MUOD, both computational time. that Fast-MUOD especially well suited handling big dense datasets with very small time methods. Further comparisons some recent also superior comparable accuracy apply weather, population growth, video
منابع مشابه
Detecting outliers in non-redundant diffraction data.
Outliers are observations which are very unlikely to be correct, as judged by independent observations or other prior information. Such unexpected observations are treated, effectively, as being more informative about possible models, so they can seriously impede the course of structure determination and refinement. The best way to detect and eliminate outliers is to collect highly redundant da...
متن کاملDetecting Outliers in Exponentiated Pareto Distribution
In this paper, we use two statistics for detecting outliers in exponentiated Paretodistribution. These statistics are the extension of the statistics for detecting outliers inexponential and gamma distributions. In fact, we compare the power of our test statisticsbased on the simulation study and identify the better test statistic for detecting outliers inexponentiated Pareto distribution. At t...
متن کاملDetecting and tracking regional outliers in meteorological data
Detecting spatial outliers can help identify significant anomalies in spatial data sequences. In the field of meteorological data processing, spatial outliers are frequently associated with natural disasters such as tornadoes and hurricanes. Previous studies on spatial outliers mainly focused on identifying single location points over a static data frame. In this paper, we propose and implement...
متن کاملDetecting Projected Outliers in High-Dimensional Data Streams
In this paper, we study the problem of projected outlier detection in high dimensional data streams and propose a new technique, called Stream Projected Ouliter deTector (SPOT), to identify outliers embedded in subspaces. Sparse Subspace Template (SST), a set of subspaces obtained by unsupervised and/or supervised learning processes, is constructed in SPOT to detect projected outliers effective...
متن کاملDetecting outliers in multivariate data while controlling false alarm rate
Outlier identification often implies inspecting each z-transformed variable and adding a Mahalanobis D2. Multiple outliers may mask each other by increasing variance estimates. Caroni & Prescott (1992) proposed a multivariate extension of Rosner’s (1983) technique to circumvent masking, taking sample size into account to keep the false alarm risk below, say, α = .05. Simulations studies here co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Advances in data analysis and classification
سال: 2021
ISSN: ['1862-5355', '1862-5347']
DOI: https://doi.org/10.1007/s11634-021-00460-9